Olga - a conversational agent with gestures Multimodal Agent Interfaces

نویسنده

  • Scott McGlashan
چکیده

The Olga project has developed an animated agent interface for information services. The interface combines a graphical interface, spoken dialogue and an animated 3D ‘human-like’ character for multimodal interaction with users. The interaction is intelligently managed using techniques derived from spoken dialogue but extended for the graphical modality. The Olga agent is innovative in combining an interactive spoken dialogue system with a 3-D animated character using lip-synchronized synthetic speech and gesturing. Particular attention has been paid to ensuring that the behaviour of the agent is immediately comprehensible for the user. Synchronizing speech with mouth movements increases intelligibility, while facial expressions and gesturing realize the agent’s internal states and focus of the dialogue. can express feedback either independently or in conjunction with speech. By representing the agent as an animated character we gain access to these modalities so as to give the system a more apparent personality, hopefully making the experience more like a human interaction. However, it is essential that there is a reasonable match between the agent’s capabilities and its realization: the more human-like the realization, the more users expect it to have human-like capabilities in, for example, speech and language understanding. Consequently, while our animated character is realized with lip-synchronized speech and gesturing, it is also cartoon-like so as to elicit behaviour appropriate to the current state of interface technology.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Olga - a conversational agent with gestures

The Olga project has developed an animated agent interface for information services. The interface combines a graphical interface, spoken dialogue and an animated 3D ‘human-like’ character for multimodal input and output. The interaction is intelligently managed using techniques derived from spoken dialogue but extended for the graphical modality. The Olga agent is innovative in combining a spo...

متن کامل

Head gestures for perceptual interfaces: The role of context in improving recognition

Head pose and gesture offer several conversational grounding cues and are used extensively in face-to-face interaction among people. To accurately recognize visual feedback, humans often use contextual knowledge from previous and current events to anticipate when feedback is most likely to occur. In this paper we describe how contextual information can be used to predict visual feedback and imp...

متن کامل

Fusion of Children's Speech and 2D Gestures when Interacting with 3D Embodied Conversational Characters

Most of the existing multimodal prototypes enabling users to combine 2D gestures and speech are task-oriented. They help adult users to solve particular information tasks often in 2D standard Graphical User Interfaces. This paper describes the NICE HCA system which aims at demonstrating multimodal conversation between humans and embodied historical and literary characters. The target users are ...

متن کامل

Timing and Rhythm in Multimodal Communication for Conversational Agents

Synthesis of lifelike gesture is finding growing attention in human-computer interaction. In particular, synchronization of synthetic gestures with speech output is one of the goals for embodied conversational agents which have become a new paradigm for the study of gesture and for human-computer interface (Cassell et al., 2000). Embodied conversational agents are computer-generated characters ...

متن کامل

KI-04-2003-Embodied-Conversational-Agents fürs web.PMD

In the following, it will be shown how these systems can be applied as a means to generate the body movements of Embodied Conversational Agents. The next section describes the CoGesT transcription system in more detail. Section 3 gives an overview of multimodal corpus creation in the TASX-environment and the synthesis of conversational gestures using Lokutor. In the final section these systems ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997